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Abstract 

We present a survey of existing approaches to relational division in 
rank-aware databases, discuss issues of the present approaches, and out¬ 
line generalizations of several types of classic division-like operations. We 
work in a model which generalizes the Codd model of data by considering 
tuples in relations annotated by ranks, indicating degrees to which tuples 
in relations match queries. The approach utilizes complete residuated lat¬ 
tices as the basic structures of degrees. We argue that unlike the classic 
model, relational divisions are fundamental operations which cannot in 
general be expressed by means of other operations. In addition, we com¬ 
pare the existing and proposed operations and identify those which are 
faithful counterparts of universally quantified queries formulated in rela¬ 
tional calculi. We introduce Pseudo Tuple Calculus in the ranked model 
which is further used to show mutual definability of the various forms of 
divisions presented in the paper. 

1 Introduction 

In this paper, we present a survey and new results in the area of division¬ 
like operations in rank-aware relational models of data. In particular, we are 
interested in models which allow imperfect matches of queries in addition to the 
usual precise yes/no matches of queries. By an “imperfect match” we mean a 
situation where given record in a database does not match a query in the usual 
sense but the record is sufficiently close to a (hypothetical) record that matches 
the query exactly. In many situations, it is desirable to include records with 
imperfect matches in the result of a query and introduce scores which indicate 
the degrees to which the records match the given query. For instance, in a 
database of products, we may query for products with price equal to $1,200. 
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In the traditional understanding, a product sold for $1,198 does not match the 
query. Nevertheless, we may want to include such product in the result and 
annotate it with a high score indicating that the product matches the query 
“almost perfectly” but not fully. In fact, reasoning with imperfect matches is 
inherent to human thinking and human perception of concepts like the proximity 
of values. Rank-aware databases El EH and related models of data aim at 
such reasoning with imperfect matches and are concerned with its formalisation, 
analysis, and implementation in computer database systems. 

Our investigation of division-like operations is motivated by the fact that 
in most of the existing rank-aware approaches to databases, discussion of such 
operations is either completely omitted or focuses only on particular Codd-style 
divisions. Indeed, compared to operations like projections and joins, the current 
rank-aware approaches pay little or no attention to division-like operations. 

There seem to be two reasons for the absence of discussions of divisions in 
rank-aware models: First, a proposed rank-aware model simply omits divisions 
because its authors do not consider such an operation important. Second, the 
authors of a rank-aware model expect a division-like operation to be definable 
by the remaining operations in a similar way as in the classic relational model of 
data. We argue that neither of the points is tenable and divison-like operations 
deserve our attention: 

1) Divisions are important Division-like operations are considered in rela¬ 
tional query systems in order to express queries which take form of particular 
categorical propositions. It is well understood that classic relational queries of 
the form “some Lp is tp'” can be expressed by means of combinations of projections 
and natural joins which are known as semijoins. Analogous queries can also be 
considered in rank-aware approaches with the same meaning except for the fact 
that the results of queries are annotated by scores. Naturally, one should ex¬ 
pect to be able to formulate queries of the form of categorical proposition “all p) 
are in a rank-aware model. In the classic model, such queries are expressed 
by division-like operations. In addition, some variants of the classic relational 
division have a close relationship to the notions of containment (subsethood) 
of relations. From this viewpoint, one should expect that containments and 
divisions in a rank-aware model should both be defined and related as in the 
classic model. Note that division-like operations are also interesting from the 
data-analytical point of view. For instance, concept-forming operators in formal 
concept analysis [23] can be seen as particular relational divisions. 
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2) Divisions in rank-aware models are fundamental operations If a 

rank-aware model contains operations of difference (relational minus), projec¬ 
tion, and natural join, one may argue that a Codd-style division [13] is a defin¬ 
able operation in the ranked model in much the same way as it is dehnable in 
the classic model. While in the classic model, reasonable division-like operations 
can indeed be derived, we show further in the paper that this assumption cannot 
be universally adopted in rank-aware models. Technically, the operation can be 
defined as in the ordinary case but in many cases it lacks the basic properties of 
“reasonable division” and no longer is a faithful representation of queries of the 
form of categorical propositions “all (p are '0”- As a matter of fact, we argue in 
the paper that suitable variants of divisions (or equivalent formalisms) should 
be included as fundamental operations in rank-aware models. 

In this paper we focus on divisions from the perspective of a relational model 
which can be seen as a generalization of the Codd [T3] model of data from 
the point of view of residuated structures of degrees. The basic idea of the 
model is that tuples in relations are annotated by scores indicating degrees to 
which tuples match queries analogously as in I1III12I, cf. also |3T] introducing 
RankSQL and a survey paper |29j . Our model differs in how we approach 
the structures of scores and, consequently, the underlying logic of imperfect 
matches. We use structures of degrees which are recognized by fuzzy logics in the 
narrow sense im (Hill [Ml HZ] and the principle of truth functionality because 
our intention is to develop the model so that particular issues handled in the 
model (like querying and data dependencies) can be analyzed in terms of logical 
deduction in the narrow sense. This is in contrast with various approaches that 
appeared earlier iiEiiiiiii] and utilized techniques from fuzzy sets (in the wide 
sense) where the connection to residuated structures of degrees is not so strict. 
We argue in the paper that the role of residuated structures is crucial for a 
sound treatment of division-like operations. 

Our paper is organized as follows. In Section we recall basic notions of 
our model. In Section we survey existing and propose new approaches to 
division operations in the classic as well as in the graded setting. In Section 
we introduce a query language called Pseudo Tuple Calculus (PTC) that enables 
us to reason about the operations with ease. Finally, in Section we utilize 
PTC to derive further observations on the mutual definability of the division 
operations described in the paper. 
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2 Relational Model Based on Residuated Structures 

In this section, we present a survey of utilized notions from residuated structures 
of degrees and fuzzy relational systems. Furthermore, we introduce the basic 
notions of the generalized relational model of data and its relational algebra [3] . 

2.1 Structures of Degrees 

We use complete residuated lattices as structures of degrees which represent 
scores assigned to tuples and indicating degrees to which tuples match queries. 
A residuated lattice HIMIIIT] is a general algebra [33] of the form 

L = (L, A, V, 0, —t, 0,1) (1) 

such that (L, A, V, 0,1) is a bounded lattice [5| with 0 and 1 being the least and 
the greatest element of L, respectively; (L, 0, 1) is a commutative monoid (i.e., 
® is commutative, associative, and a®l = l(8)a = a for each a S A); ® (a 
multiplication) and —> (a residuum) satisfy the adjointness property: 

a®h<ciiia<b^c (2) 

for each a, 6, c S L where < is the order induced by the lattice structure of L 
(i.e., a < 5 iff a = a A &). A residuated lattice 0 is called complete if its lattice 
part is a complete lattice, i.e., if L contains infima (greatest lower bounds) and 
suprema (least upper bounds) of arbitrary subsets of L. The multiplication 
0 and its adjoint residuum —>■ can be seen as general aggregation functions 
which interpret general “conjunction” and “implication” of scores, respectively. 
That is, if a tuple matches query Qi with a score oi and it also matches query 
Q 2 with a score 02 , then oi 0 02 may be interpreted as the score to which 
the tuple matches the composed conjunctive query “Qi andQ 2 -' This way the 
aggregation function is understood in |21j . In a similar way, oi —> 02 may be 
interpreted as the score to which the tuple matches the composed conditional 
query “if Q\ then Q 2 ” 

A typical choice of a complete residuated lattice L is a structure given by a 
left-continuous triangular norm |30]. That is, L = [0,1] (real unit interval), A 
and V are minimum and maximum (in which case the induced < is the genuine 
ordering of reals), and 0 is a left-continuous triangular norm. The left-continuity 
of ® ensures there is a residuum —>■ satisfying ([^ which is in addition uniquely 
given by 

a —>■ & = \/{c G A I a 0 c < 6}. (3) 
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In words, ^ says that a —?► 5 is the supremum of all c S L such that a(8)c < b (it 
can be shown that a —>■ & is in fact the greatest c € L satisfying such property). 

From pragmatic standpoints, the most important complete residuated lat¬ 
tices are exactly those on the real unit interval given by continuous trian¬ 
gular norms. All such structures can be obtained by constructing ordinal 
sums [1I271ISD] of (isomorphic copies of) three basic pairs of multiplications 
(and their corresponding residua): a^b = max(a -I- 6 — 1, 0) (Lukasiewicz mul¬ 
tiplication), a®b — min(a, b) (Godel or minimum multiplication), a ® b = a ■ b 
(Goguen or product multiplication). 

Remark 1. The role of residuated lattices as general structure of truth de¬ 
grees in truth-functional logics has been recognized by Goguen |25j . Impor¬ 
tant logics based on subclasses of residuated lattices include Hohle’s monoidal 
logic [28], Basic Logic and Monoidal T-norm Logic |20|. Note that the 
truth-functionality is a crucial property which is not present in other models 
which also involve ranks like the probabilistic extensions of the Godd model, 
see m. In fact, the probabilistic databases tackle completely different issues 
and deal with uncertain data which is not our case because the approaches we 
discuss here deal with certain data and imperfect matches of queries. 

An important aspect of the relational model which is relevant to our paper 
is that the classic relational model is based on the classic predicate logic m- 
As a result, finite relations (informally represented by “data tables”) are used to 
represent both the base data and results of queries. In fact, database instances 
(i.e., collections of relations interpreting relational symbols/variables) can be 
seen as predicate structures [35], predicate formulas can be seen as queries, 
and their interpretation in database instances corresponds to query evaluation. 
Thus, the structures of truth values of the classical predicate logic—the Boolean 
algebras, are vital for the model and, loosely speaking, determine laws that hold 
in the relational model. 

The model we use in this paper can be seen as a relational model of data 
which results from the classic one by replacing the Boolean algebras with com¬ 
plete residuated lattices. This change has, of course, its implications. First, 
we shift from structures with only yes/no matches to structures which allow 
us to work with general (intermediate) degrees—this is a desirable property for 
development of a rank-aware model. Second, some laws that hold in the classic 
model are no longer valid (e.g., tertium non datur). The second point shall be 
understood as virtue of the model rather than a vice—note that Basic Logic 
extended by tertium non datur collapses into the classical logic m- In fact. 
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there are no proper fuzzy logics which satisfy tertium non datur. Our rationale 
for using (complete) residuated lattices as the structures of degrees is that they 
represent more general structures than the Boolean algebras which allow us to 
deal with intermediate degrees and are still reasonably strong. 

Remark 2. Let us note that logics based on residuated lattices are used to reason 
about general scores. If 0 and 1 are used as the only scores, the logic collapses 
into the classic Boolean logic which is a desirable property. Also, the structures 
and operations of the generalized model can be implemented inside the classic 
relational model using the ordinary notions of relations on relation schemes and 
additional operations with relations. 

2.2 Attributes, Types, and Ranked Data Tables 

In this section, we present our counterpart to the classic relations on relation 
schemes. We utilize the following notions. We denote by y a (infinite denu¬ 
merable) set of attributes, any finite subset i? C y is called a relation scheme. 
For each attribute y € Y we consider its type Dy which is understood as the 
admissible set of values of the attribute y, see [17] (note that in earlier litera¬ 
ture, types are called domains, cf. [13]). In the paper, we do not refer to types 
explicitly, i.e., whenever we introduce an attribute, we tacitly consider its type 
and for simplicity we assume that attributes with the same name have the same 
type. 

We utilize the usual set-theoretic representation of tuples: A direct product 
riygfl of an i?-indexed system {Dy | y S i?} is a set of all maps 

r: UyGfi (4) 

such that r{y) S Dy for each y £ R. If i? C y is finite, then each r £ IlyGi? 
is called a tuple on relation scheme R, r{y) is called the y-value ofr. For brevity, 
is denoted by Tupl(i?). For SCR and r £ Tupl(i?), we denote by 
r{S) the projection of r onto S, i.e., r{S) C r such that {y,d) £ r{S) for some 
d £ Dy iS y £ S. In particular, r(0) £ Tupl(0) = {0}, i.e., 0 is the only tuple 
on the empty relation scheme. Moreover, if r € Tupl(i?), s £ Tupl(S'), and 
r(R n S) = s(R n S), we call the set-theoretic union r U s the join of tuples r 
and s and denote it by rs. 

The relations (on relation scheme R) which appear in the classic model 
are finite subsets of Tupl(i?). Technically, such subsets can be identified with 
indicator functions which assign 1 to finitely many tuples from Tupl(i?) (to 
those belonging to the relation) and 0 otherwise. Our counterpart to relations on 
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relation schemes result by considering such indicator functions with codomains 
being the set of degrees from complete residuated lattices. 

Definition 1. Let L be a complete residuated lattice, i? be a relation scheme. 
A ranked data table on relation scheme (shortly, an RDT) is any map of the 
form T) : Tupl(i?) —)■ L such that {r G Tupl(i?) | V^r) > 0}, called the answer 
set of V, is finite. The degree ^{r) is called the score of r in V. 

Remark 3. (a) Important special cases of RDTs are represented by RDTs on 
the empty relation scheme. Recall that in the classic model there are only 
two relations on 0, namely the empty relation on 0 (called TABLE_DUM in [17) 1 
and the relation on 0 containing the empty tuple (called TABLE_DEE). In our 
case, all RDTs on the empty scheme are maps of the form V : {0} —>■ L, i.e., 
they are uniquely given by the degree I?(0) G L, i.e., by the degree which is 
assigned to 0 (the empty tuple) by V. Because of this correspondence, for each 
degree a G L, we define a 0 : Tupl(0) -G L as the RDT such that a 0 ( 0 ) = a. 
Hence, in addition to TABLE_DUM (O 0 in our notation) and TABLE_DEE (I 0 in 
our notation) our model admits general DEE-like RDTs for every a G L, leaving 
O 0 and I 0 as two borderline cases. As it is argued in m, special cases of 
divisions which involve TABLE_DUM and TABLE_DEE are important and have been 
often neglected in various approaches to division, which in consequence led to 
divisions with undesirable properties. In our case, the DEE-like tables a<^ play 
analogous important role and shall be taken into account. 

(b) RDTs on non-empty relation schemes can be depicted analogously as 
classic relations on non-empty relation schemes by two-dimensional data tables 
with columns corresponding to attributes and rows corresponding to tuples. In 
addition, each row in the table is annotated by the score of the tuple represented 
by the row (tuples with zero scores are not shown in the table). 

(c) If 'D{r) G {0,1} for all r G Tupl(i?), we call V non-ranked. Clearly, 
non-ranked RDTs are in a one-to-one correspondence with (finite) relations on 
relation schemes in the usual sense. A particular case of a non-ranked table is 
0/j called the empty table and satisfying 0/j(r) = 0 for all r G Tupl(i?). 

2.3 Relational Operations 

By virtue of the close connection to logics based on residuated structures of 
degrees, the rank-aware model we consider admits two basic types of domain 
independent query systems [3]. First, a system based on evaluating predicate 
formulas. Second, a system consisting of relational operations which has the 
same expressive power as the former one. The relational divisions considered in 
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this paper are particular (fundamental or derived) relational operations. In this 
subsection, we recall a fragment of the relational operations we need to cope 
with divisions. 

For Vi and I?2 on the same relation scheme i?, we define H I?2 {intersec¬ 
tion) and Vi U I?2 (union) by 

{VinV2)ir)=Vi{r)AV2ir), ( 5 ) 

(I?iUl?2)(r) =I?i(r)Vl?2(r), ( 6 ) 

for all r € Tupl(R). In words, H and U are defined componentwise using the 
lattice operations A and V in L. 

The natural join in our model is introduced as follows. If is an RDT on 
relation scheme RU S and I?2 is an RDT of relation scheme S U T such that 
i?nS' = i?nr = s HT = 0 (i.e., R, S, and T are pairwise disjoint), then the 
natural join of Vi and I?2 is an RDT on relation scheme RU S UT denoted by 
Vi ex] 1)2 and defined by 

{'Dit<i'D2){rst) ='Di{rs) <SiT>2{st), (7) 

for each r G Tupl(i?), s G Topes'), and t G Tupl(T). Hence, (g) in L acts as 
a conjunctive aggregator which generalizes the classic conjunction appearing in 
the definition of ordinary natural join of relations. If T) is an RDT on i?, the 
projection of V onto S' C R is denoted by nsiR) and defined by 

{tTs{V)){s) = VtGTupl(fl\S) '^(st), (8) 

for each s G Tupl(S). Using projections of tuples onto S, we may write 
equivalently as {tts{T>)){s) = \/{'D{r) \ r{S) = s}. Since (g is distributive over \/, 
we may introduce a semijoin of Vi on R and I?2 on S as 'kji{'Di to I?2) or 
equivalently as Vi to 7 rijns( 7 ^ 2 ) and we denote it Vi kT>2- 

Analogously as in the classic model, semijoins in our model are important 
since they allow us to algebraically express existential queries of the form of 
categorical propositions “some ip is or, in the database terminology HU, 
“some tuples from Vi are matching tuples in I?2” • 

Remark 4. (a) One may check that if all arguments to the above-mentioned 
operations are non-ranked, then the results of relational operations coincide 
with the results of the classic relational operations of union, intersection, natural 
join, and projection [Hin]. 

(b) Let us comment on the role of the general suprema in Q. In predicate 
logics based on residuated structures of degrees [10], general suprema are used 


to interpret existentially quantified formulas. In a more detail, for a formula of 
the form {3x)ip, its truth degree ||(3a:)(/j||M.i; in the L-structure M under the 
evaluation v of object variables is defined as the supremum of all truth degrees 
||</^||m,u) where w{y) = v{y) for each variable y such that y ^ x. Put in words, 
is the least upper bound of all degrees to which ip is true in M 
considering cc as a variable which can be assigned any value from the universe 
of M. Note that if L is the two-element Boolean algebra, this interpretation 
coincides exactly with the usual interpretation of existentially quantified formu¬ 
las and, in particular, ||(3a:)i^||M.i; = 1 iff there is w such that ||(/5 ||m,uj = 1 
and w{y) = v{y) for all y ^ x (i.e., x can be assigned a value which makes 
if true in M). Now, since projections are relational operations which express 
queries formulated by existentially quantified formulas in relational calculi, 
is defined in terms of \/. In words, {Trs{'D))(s) is a degree to which there is a 
tuple in T) whose projection onto S equals to s. 

3 Existing and New Approaches to Division 

In this section, we review several classic approaches to division which appeared 
in the literature on database systems, present their rank-aware counterparts, and 
comment on their relationship to the existing rank-aware or fuzzy approaches in 
databases. The section is structured into subsections which roughly follow the 
structure of m which is arguably the best comparison of division-like operations 
from the point of view of the relational model of data. 

In this section, whenever we say that (a relation or an RDT) V is on scheme 
RS, we mean that it is defined on the scheme RU S such that i?n 5 = 0. 

3.1 Codd-style Division 

Historically, the Codd division is the initial operation in the family of division¬ 
like operations. Its initial pnrpose was technical—to ensure completeness of the 
relational algebra with respect to the relational calculus which allows us to ex¬ 
press queries involving universal quantification. Strictly speaking, its presence 
in the relational algebra is not necessary since in the classical logic, universally 
quantified formulas of the from {\/x)(p can be replaced by formulas -«{3x)-^ip, 
i.e., universal quantifiers are expressible by means of negations and existential 
quantification. Thus, the division is considered as a derived operation which is 
expressed by means of set-theoretic difference (relational counterparts to nega¬ 
tions) and projections (relational counterparts to existential quantification). 
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Namely, for a relation Vi on RS and relation V 2 on S, the Codd division 
'Di -^Codd 2^2 may be introduced [161 

2^1 -^Codd 2?2 = t^r{'Di) \ CXI V 2 ) \ 2?i), (9) 

where ttb., >xi, and \ denote the usual projection, natural join (cross join in this 
particular case), and set-theoretic difference, respectively. The survey chap¬ 
ter m identifies several epistemic issues of ([^. The most important are: 

(i) Unlike semijoins, is restricted to relations on particular schemes, i.e., 
the operation cannot be performed with relations on arbitrary schemes 
which makes it less general (and less useful). 

(ii) The meaning of © does not faithfully correspond to the categorical propo¬ 
sition “all ip are ■i/'”. If (/? is s S 2?2 and V” is rs € 2?i, then 

(Vs)(se2?2^rse2?i) (10) 

is true for all r S Tupl(i?) provided that 2?2 is empty. In contrast, the 
result of ([^ is always a subset of Hence, in general, the meaning 

of is “any r in such that rs € T>i for all s € V 2 ” rather than 

“any r such that rs € Vi for all s G 2?2”, cf. [TS]. As a consequence, (|^ 
is equivalent to 

2?i -G 2?2 = {r G t^r{Pi) \ for all s G 2?2, we have rs G 2?i}, (11) 

where -KjiiRi) can be seen as the range for the division. 

By a direct generalization of in rank-aware approaches, we inherit both 
the issues. In addition, it is questionable how to handle \ in the presence of 
scores. One way to go is to consider {Vi \ 2?2)(r) to be the degree to which r is 
in and is not in I ?2 and express the negation using —>■ and 0, i.e., 

(I?i\I? 2 )(r)= 2 ?i(r)®(I? 2 (r)^ 0 ). (12) 

Although Vi \ V 2 is always finite, it does not fulfill basic properties one would 
expect for a difference. For instance, \ I ?2 = Oi^ does not imply Vi C I ?2 
in general. Alternatively, one may introduce \ as an independent fundamental 
connective in L and induce the difference of RDTs componentwise analogously as 
n or U. For instance, one may use commutative doubly-residuated lattices [55] 
with \ being adjoint to a non-idempotent disjunction. Note that difference¬ 
like operations with relations (with scores) in the database literature are often 


10 


defined analogously as (12), usually on L = [0,1] with 0 being the minimum 
and —?► being the Lukasiewicz implication [8]. The general issue with graded 
style-versions of ([^ is that universal quantifier (interpreted by infima in L) is 
not definable using the existential one (interpreted by suprema in L). 

Most common truth-functional approaches [siiniiTiis] that can be found in 
literature on rank-aware extensions generalize ( |10[ ) by putting 

(Pi -I? 2 )(r) = A. 6 Tupi(S)(^ 2 (s) ^ 'D,{rs)) (13) 


for all r G Tupl(i?) provided that Pi and P 2 are RDTs on schemes RS and S, 
respectively. In our setting, /\ is the operation of infimum in L, and —>■ is the 
residuum in L. The above-cited approaches often use a fixed scale of degrees 
(with L = [0,1]) with —>■ being a general truth function of implication. In 
addition to —)■ which are adjoint to 0 (so-called R-implications), the approaches 
use S-implications [9]. We do not want to endorse this concept here because of 
its marginal role in fuzzy logics in the narrow sense, see |26j and the soundness 
issues regarding S-implications. 


Remark 5. Observe that since r G Tupl(i?), (13) solves issue (m) but this is at 
the expense of losing domain independence. Indeed, if R contains an attribute 
which has a type consisting of infinitely many values then the result Pi G- 
P 2 defined by (13) is infinite which is highly undesirable property from the 
database viewpoint—if a materialization of Pi -G P 2 is necessary in order to 
evaluate a compound query involving the division, the evaluation cannot be 
performed (in finitely many steps). Probably because of this issue, some of the 
graded approaches cited above use (13) assuming that {TTji{'Di)){r) > 0 which, 
unfortunately, introduces (ii) again. 


In our previous work [4], we have used a fundamental domain-dependent 
division operation which is sufficient to establish the equivalence between a 
domain-dependent relational algebra and a domain relational calculus. Recently, 
we have proposed a domain independent variant [3] with explicit range which 
is used to establish the equivalence between a domain-independent relational 
algebra and a domain relational calculus with range declarations. The operation 
is defined as follows. 

Let Pi, P 2 , and P 3 be RDTs on RS, S, and R, respectively. Then, a division 
Pi P 2 of Pi by P 2 which ranges over P 3 is an RDT on R defined by 


(Pi P 2 )(r) = AseTupi(S)(^ 3 (r) 0 (P 2 (s) ^ V,{rs))), (14) 

for each r G Tupl(i?). Clearly, Pi -G®^ P 2 C P 3 . In addition, ( [l4| ) possesses 
further desirable properties. For instance, if P 3 is non-ranked, then Pi-G®^P 2 is 
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the greatest among all T> CV^ such that I? ixi QT^i- Furthermore, if ii = 0 
and I ?3 = l 0 (see Remark , then (14) becomes (the relational representation 
of) the subsethood degree of I ?2 in see [2]. Also, the definition eliminates 
(ii) and is domain independent. 

3.2 Date’s Small Divide (Original and Generalized) 

In order to overcome issue {ii), Date (see m and the references therein) pro¬ 
posed a Small Divide operation. Consider the following relations on relation 
schemes: on R (called the dividend), I ?2 on S (called the divisor), on 

RS (called the mediator). Then, the original version of Small Divide [H] is 

-fdo 2^2 = \ ttr{{V, m V2) \ 'Ds) 

= {r G Dll for all s G I? 2 , we have rs G I? 3 }. (15) 

A graded generalization of (dH) is 

(2^1 -g/do 2 ^ 2 ) (r) = v,{r) 0 A.eTupi(S) ( 2 ^ 2 ( 5 ) ^ V,{rs)) (16) 

with Vi, V 2 , and I ?3 being RDTs on R, S, and RS, respectively. The graded 


variant of the Small Divide and (14) are equivalent under the following condi¬ 
tions: 

Theorem 2. If L is prelinear or divisible, then V 2 = Vi ^ 2 - 

Proof. Either of prelinearity or divisibility ensures that a® (5 Ac) = {a(^b)A{a(^c) 
for all a,b,c G L, see [HEoIEt]. In addition, since I ?2 and I ?3 are finite, in 


both (16) and (14) the infimum is computed using only finitely many degrees 


other than 1 , i.e., the claim follows by distributivity of 0 over infima of finitely 
many degrees which are pairwise distinct. □ 

Note that analogous observation holds if L is arbitrary and T>i is non-ranked. 

Remark 6 . The previous observation has two important consequences: In the 
mainstream fuzzy logics (based on prelinear residuated lattices), graded Small 


Divide and (14) are equivalent. In particular, if L is the two-element Boolean 


algebra, the ranked model becomes the classic one, i.e., this observation pertains 
to the classic relational model. 

In order to cope with issue (z), the original Small Divide has been further 
extended to accomodate relations on more general schemes. Namely, for Ri on 
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RT, I ?2 on SU, and on RSV, Date introduced (TH] a general form of Small 
Divide as follows: 

Vi V 2 = ViK ((7rfl(Di) 7TsiV2)) k V 3 ). (17) 

where k denotes the semidifference, i.e., T) k V = V \ (I) k V). By moment’s 
reflection, we derive that 

Vi D 2 = {rt e Vi I for all s G ^ 3 (^ 2 ), we have rs G nRsiRs)}- (18) 

We may therefore introduce the following operation in the graded setting 

(Di V 2 )irt) = Di(rt) ® A.6Tupi(5)(^s(252))(s) ^ (^fls(2?3))(rs)) (19) 

provided that Vi, I? 2 , and D 3 are RDTs on RT, SU, RSV, respectively. As 
in the classic setting, -Ggsd eliminates both the issues (i) and {ii) mentioned 
earlier. 


3.3 Todd-style Division 

An alternative approach to eliminate issue (i) is the division proposed by Todd, 
cf. [IH]. Written directly in the set notation, 

-^Todd 7^2 = {rt G I for all s G Tupl(5'): if st &V 2 , then rs G Vi}, (20) 


where lA = TrR{'Di) 1 x 1 ttt[V 2 ). Unfortunately, G-xodd and its direct rank-aware 
generalizations inherit the issue {ii). This is caused by the fact that the ranges 


for r and t in ( 20 ) are considered to be the projections of Vi and I? 2 , respectively. 


Interestingly, if U is considered to be the set of all tuples on RT, the graded 
generalization becomes 

(Di G-gTodd 7 ^ 2 )(rt) = AsGTupi(S)(^ 2 (st) ^ 7?i (rs)) (21) 

which is the Kohout-Bandler superproduct composition mm of fuzzy relations 


Vi and I ?2 (in this order). As in the case of (13), G-gXodd is domain dependent 


i.e., even if and I ?2 are hnite, the result of (211 may be infinite which is an 
undesirable property. 


3.4 Date’s Great Divide 

In the same spirit as the Small Divide has been proposed to eliminate the issues 
of the classic Codd division, the Great Divide has been proposed by Date [16] 
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to deal with the issues of the Todd division. Again, we may assume two variants 
of the operation—the original one and the generalized one. For illustration, we 
focus here only on the original variant, the generalized one can be obtained in 
much the same way as in the case of the Small Divide. 

According to nn, for relations on R (called the dividend), 'D2 on T 
(called the divisor), on RS (called the first mediator), and I?4 on ST (called 
the second mediator), we put 


= {Ti M V2) K ((Pi « P4) K Pa). ( 22 ) 

The definition ( | 2 ^ can be equivalently expressed in the set notation as follows: 
Pi P2 = {rt £U\ioi all s S Tupl(S'): if st G P4, then rs G P3}, ( 23 ) 


where = Pi ixi P2. Based on ( 23 ), we may introduce a graded variant -Gggdo 
of the original Great Divide as follows 


(2^1 -Sir T^ 2 ){rt) = Pi(r) 0 P 2 (t) 0 AseTupi(S)(254(si) ^ V^{rs)) 

= (Pi ex V2){rt) 0 AsGTupi(S) (^4 (si) ^ P 3 (r's)) (24) 


with Pi, P2, P3, and P4 being RDTs on R, T, RS, and ST, respectively. 
Loosely speaking, ( 24 ) can be seen as a domain-independent variant of the 
Kohout-Bandler superproduct composition whose range is limited to the natural 
join of Pi and P2. 

Analogously as in the classic case, the graded Great Divide is more general 
than the graded Small Divide. In particular, -Ggsdo can be seen as G-ggdo with 
the divisior being the RDT I0 on the empty relation scheme: 


Corollary 3. We have Pi ^^2 = Pi 


■ ggdo 


Ik 


□ 


As we have already mentioned, ( 24 ) can be generalized in a similar way 
as ( 19 ) to handle RDTs on more general relational schemes. 


3.5 Darwen’s Divide 

Later, Darwen m proposed another division-like operation which is now com¬ 
monly called Darwen’s Divide. This operation is defined similarly as Date’s 
Great Divide but it does not impose any requirements on the relation schemes 
of its arguments. 

The definition is as follows m- For relations Pi on Pi (called the dividend), 
P2 on R2 (called the divisor), P3 on P3 (called the first mediator), and P4 on 
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i?4 (called the second mediator), we put 

^2 = (Pi ^ ^ 2 ) K ((Pi M P 4 ) K Pg). (25) 

Note that the relation scheme of result of Darwen’s Divide is Ri U R2 since i?i 
and i?2 are arbitrary and might have some attributes in common. 

In the proof of the set notation of Darwen’s Divide we utilize the following 
lemma. 

Lemma 4. Consider relations Pi on RS and P 2 on ST such that R, S, T are 
pairwise disjoint (RD S = RCT = S CT = %). For every tuple r S Tupl(i?) 
and s G Tupl(S') we have rs S Pi k P2 iff 

rs S Pi A -n( 3 t G Tupl(r))st G P2 ( 26 ) 

or equivalently 

rs G Pi A -n( 3 s't G Tupl(S'T)) ((rs)( 5 ') = {s't){S) A s't G P2), ( 27 ) 

where s' G Tupl(iS') and t G Tupl(T). 


Proof. The first part follows directly from the definition of semidifference: 

rs G T)I K P 2 rs S Pi \ irjisi'Di ixi P2) 

<;=^ rs G Pi A -"(rs G Pi A s G 7rs(P2)) 

(rs G Pi A -nrs G Pi) V (rs G Pi A -"S G 7rs('l^2)) 

always false 

<;=^ rs G Pi A -"(3t G Tupl(r))st G P 2 . 

The rest follows from the fact that R,S,T are pairwise disjoint and {rs){S) = 
{s't){S) is equivalent to s = s'. □ 


To simplify the notation, for two tuples ri G Tupl(i?i) and r2 G Tupl(i?2) we 
denote by ri 0 r2 the fact that ri and r2 are joinable (ri(i?ini?2) = 'r2(Pini?2))- 
Let us note that the Lemma can be applied to relations on arbitrary 
schemes. For relations Pi on i?i and P2 on R 2 it suffices to put R — Ri\R 2 , S = 
Ri n i?2 and T = i?2 \ 7 ?i. Obviously, relation schemes R,S,T defined in this 
manner are pairwise disjoint and it holds that Ri = RUS and R 2 = SUT. Now 
for ri G Tupl(i?i) using ( [ 27 | we have ri G Pi k P2 iff 

ri G Pi A -"(3r2 G Tupl(i?2)) (^’i 0 ’"2 A r 2 G P2) ( 28 ) 


To put ( 28 ) in words, tuple ri belongs to the result of semidifference of Pi 
and P2 (in this order) iff ri belongs to Pi and there is no tuple r2 from P2 that 
is joinable with ri. 
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Theorem 5. Consider relations Vi on Ri, 'D2 on R2, I?3 on R^, and Vi on R4. 
The definition ( 25 ) can be equivalently expressed in the set notation as follows: 


= ( 29 ) 

{rir2 G U \ for all r4 G Vi’, if rir2 0 r4, then there is r^ G V3: rir4 0 r3}, 

where U = T>\^ I?2 • 

Proof. First, the fact that Vi I?2 C I?i to I?2 follows directly from the 

definition of semidifference. 

For brevity, in the following proof we will denote the join of Vi and I?2 by 
U = Vi \xi 'D2- Now, let n G Tupl(i?i) and r2 G Tupl(i?2) be joinable tuples. 


Using ( 281 we have 


* ddo 

rir2 e {Vi to I?2) ^ {(Ri IX Ra) ^ T's) 

rir2 GU A -n( 3 r' G Tupl(i?i U R4)) (rir2 ^ r' A r' G (Vi X V4) K V3) 

The tuple r' G Tupl(i?i U R4) can be seen as a join of tuples r' = r(r4, 
where G Tupl(i?i) and r4 G Tupl(i?4) such that rj 0 r4. We can replace the 
( 3 r' G Tupl(i?i U R4)) with ( 3 r( G Tupl(i?i))( 3 r 4 G Tupl(i?4)) and additional 
constraint that ensures joinability of and r4. 

It is easy to see that rir2 is joinable with r' if and only if rir2 is joinable 
with all “components” of r' (here with both r( and r4). Symbolically, we have 
rir2 0 r' iff rir2 0 ^ 0 »" 4 - Since both ri,r( G Tupl(i?i), the first 

condition rir2 0 r( is equivalent to ri = r(. Furthermore, second condition 
T’ir2 0 r4 implies ri 0 r 4 . 


Continuing the proof and applying ( 281 to the second semidifference we have 
rir 2 G V4 -Sf ^ V2 

•i=^ rir2 GU A -n( 3 r' G Tupl(i?i U R4)) {rir2 0 r' A r' G (I?i X 1)4) K 
rir2 GU A -"( 3 r( G Tupl(i?i))( 3 r 4 G Tupl(i?4)) 

(r( 0 r4 A r[ = ri A rir2 0 ?’4 A r(r4 G (I?i X 'D4) K V^) 

4 =^ rir2 GU A -"( 3 r 4 G Tupl(i?4)) (rir2 0 r4 A rir4 G (I?i X 'D4) K V^) 

4 =^ rir2 GU A -"( 3 r 4 G Tupl(i?4)) 

(f’U2 0 ^4 A rir4 G X>i X T>4 A -"( 3 r 3 G Tupl(i?3)) (rir4 l^r^ Ar^G V^)) 

Now, rir4 G X I?4 is equivalent to ri G A r4 G I?4 provided that ri 
is joinable with r4, but this is ensured by rir2 0 r4. Furthermore, ri G does 
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not depend on the existence of r4 and can be taken outside the scope of the 
quantiher. We get 


rir 2 GU A “"(3r4 G Tupl(i? 4 )) 

0 ^4 A rir 4 G I?i to I ?4 A -"(Bra S Tupl(i? 3 )) (rir 4 ^ rs A rs G V^)) 
rir 2 GU A -n(ri G I?i A (3r4 G Tupl(i? 4 )) 

(?'ir 2 0 r4 A r4 G I?4 A -"(Bra € Tupl(i?a)) (?’i4‘4 0 ’’s A ra G I?a)) 

always false 

/--V 

(rir2 GU A -"ri G Vi) V (rir2 GlA A -"(3r4 G Tupl(-R4)) 

{rir 2 0 ^4 A r4 G I?4 A -"(Brg G Tupl(i?a)) (?’iT‘ 4 0 ’’3 A ra G I?a))) 
rir 2 G A (Vr 4 G Tupl(i? 4 )) 

(-"(fir 2 0 ^4 A r4 G T>4) V -"-"(3ra G Tupl(i?a)) {rir4 0 ?'3 A ra G I?a)) 
rir 2 GU A (Vr 4 G Tupl(i? 4 )) 

((4’i?’2 0 ?'4 A r4 G Vi) =¥ (3ra G Tupl(i?a)) (^i?’4 0 A ra G I?a)), 


which concludes the proof. 


□ 


Now, based on ( 29 ), we may introduce a graded variant -Ggddo of the Dar- 
wen’s Divide as follows 


(2?i -gdd^ 2?2)(rir2) = 

= 'Di{ri)®'D2(r2)® /\ (^X>4(?’4) V 

r’4GTupl(4i4) r3GTupl(fl3) 

J'ir2!)r4 I'ir45r3 

= {Vi to I?2)(?’1?'2) ® f\ {V 4 {r 4 ) -G \J Da(r 3 )) ( 30 ) 

r4GTupl(4J4) r-3GTupl(i?3) 

rir2Qr4 i'ir4j)r3 

The condition of joinability is not necessary and can be avoided. We can 
put i? 4 \i 2 = RaX {Ri U R2), .R 4 ni 2 = i?4 n (i?i U R2), Rsyii = .R3 \ {Ri U R4) 
and i?3ni4 = .^3 13 {Ri U R4). Obviously, it holds that i?4\i2 0 i?4ni2 = 0 and 
i?4\i2 U i?4ni2 = R4- The same holds for i?3\i4 and i? 3 ni 4 . Now, denote by 
^12 = (?’i?'2)(.R4ni2) the projection of tuple rir2 onto i?4ni2 (he. onto common 
attributes of R4 and i?i U i?2- Considering r^ G Tupl(i?4\i2) we get r4 = rf2?'4- 
Observe, that tuples r^2 and r^ are always joinable since i?4\i2 H i?4ni2 = 0 - We 
have expressed the tuple r4 without any need for joinability condition and we 
can remove the condition from the infimum operation. 
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We can now proceed to the condition in supremum. Note, that 

riTi = = ri(rir2)(i?4ni2)?’4 = ri(»"2)(-R4n2)?’4 = rirl^r'^^ 

Again, by = (rir4)(i?3ni4) = (rir2^r4)(i?3ni4) we denote the projection 

A Q 

of the tuple in question onto -R 3 ni 4 - For r'^ € Tupl(i? 3 \i 4 ) we get r 3 = r 44 r 3 . 

A Q 

Using similar argument, and rg are always joinable. 

Putting both observations together we finally get 

(2^1 T^ 2 ){nr,) = 

= {Vi CXI I?2)(?'l?'2) 0 A (^4(r4) ^ V ^3(^-3)) 

r4eTupl(i?4) r-3eTupl(i?3) 

rir 2 ^ri rir4Qr3 

= (I?i«I?2)(rir2)® /\ (V44tr',)^ V ^3(rf4^3)) (31) 

r^eTupl(i?4yi2) r^eTupl(i?3yi4) 

= (I?i«I?2)(rir2)® /\ (l?4(r?2M)^^i?3n44(2?3)(r?4)) (32) 

<6Tupl(i?4yi2) 

Graded Date’s Great and Small Divide can be easily expressed by the graded 
version of Darwen’s Divide in the following way. 


Theorem 6. For relations on schemes that conform to requirements for Great 
Divide, precisely for relations Vi on R, I ?2 on T, on RS, and D 4 on ST, 
we have 


Vi 


_^X>3,X>4 
• ggdo 


I?2 = Dl 


_^X>3,X>4 
* gddo 


V 2 . 


Proof. For relations Di on R, D 2 on T, on RS and D 4 on ST, we have 


(2^1 T^2){rt) = 

= (X>i cx X>2)(rir2) G A ^ ^RSn{RuST){T’ 3 ){rll)^ 

rieTupl(ST\(fiUT)) 

= (Vi cx I?2)(rt) G A (^4(st) -> Dsirsfj 

seTupics) 

= (2^1 -SA T^2){rt). 


□ 


Corollary 7. For relations on schemes that conform to requirements for Small 
Divide, precisely for relations Di on R, D 2 on S and D 3 on RS, we have 


-Di -^sdo 


2^2 = Di l0- 


□ 
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4 Pseudo Tuple Relational Calculus 

In this section, we present a query language we use in this paper for easier 
reasoning about the relational algebra operations. The Pseudo Tuple Calculus 
(shortly, PTC) is similar to the ordinary tuple calculus, however, it provides 
more convenient way to reason about relational algebra expressions in the pres¬ 
ence of scores. In the next section we use the PTC to show mutual relationships 
among the division operations. 

4.1 PTC-expressions and their evalnation 

Every PTC-expression T(ri,...,r„) of Pseudo Tuple Calculus is associated 
with a finite set of free tuple variables ri,...,r„ that appear in the PTC- 
expression. For each tuple variable we consider its relation scheme Ri. We 
assume that tuple variables with the same name have the same relation scheme. 
The relation scheme Rj- of PTC-expression T(iri,..., r„) is given by the union 
of relation schemes of the tuple variables Rj- = IJ^i 

Since we do not utilize any disjunctive operations in this paper we define here 
only a fragment of the Pseudo Tuple Calculus without the corresponding dis¬ 
junctive expressions. For the same reason we omit the treatment of restrictions 
as well. 

4.1.1 Syntax of PTC-expressions 

The PTC-expressions are defined inductively as follows. 

1. if F is a relational algebra expression (shortly, RA-expression) on relation 
scheme R and ri,..., r„ are tuple variables on i?i,..., such that R = 

then A(ri,...,r„) is an (atomic) PTC-expression on relation 
scheme R. 

In order to keep our notation simple, we abbreviate finite sets of tuple 
variables as r = {ri,..., r„} and their corresponding relation schemes as 
i?r = Ur=i simplified notation, the (atomic) PTC-expression 

E{ri, ..., r„) becomes E{r). 

2. if 7i(ri) and 72(r2) are PTC-expressions on i?ri and Rr^ respectively, 
then (7i(ri) o72(i’2))(ri U Y 2 ) is PTC-expression on R^^ U R,.^, where o 
is one of the following symbols 0, A,—>. Note that ri U r 2 is well-defined 
since we assume that tuple variables with the same name have the same 
relation scheme. 
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To simplify notation, we do not have to explicitly mention the set ri U r 2 
since it can be easily deduced from the form of the subexpressions. Thus, 
the above mentioned PTC-expression becomes 7i(ri) o72(i"2)- In more 
complex expressions we utilize outer parentheses to avoid ambiguity in 
the usual way. 

3. if T(r) is PTC-expression on then (VT(r))(r) and (AT(r))(r) are 
PTC-expressions on R^. In simplihed notation we have VT(r) and AT(r). 

4. if T(ri U r 2 ) is PTC-expression on R^^ U i?r 2 such that R^ C i?r 2 = 0 

then U r 2 )) (i’ 2 ) and >' 2 )) (i’ 2 ) are PTC-expressions 

on Rr^- 

For aesthetic reasons we will denote the set ri U r 2 by ri,r 2 . In the 
simplified notation we get Vn and An 'r(ri,r 2 ). 

4.1.2 Semantics of PTC-expressions 

The evaluation of PTC-expressions is based on the notion of a database in¬ 
stance X>. Loosely speaking, a database instance assigns appropriate relations 
to relation symbols from a database scheme—database instance can be seen as 
a snapshot of all base relations that we have in some database. Relations in 
database naturally change in time, however the database instance is fixed as it 
reflects the state of the database in a given point of time. We tacitly assume 
that the database scheme is clear from the context. 

Furthermore, we utilize the notion of extended active domains. First, we 
define the active domain adom{y, V) for the given attribute y and relation V as 
a projection of T) onto y where all tuples with non-zero scores have their score 
set to one. We denote by Pf {i € /) relations from the given database instance 
X> whose relation schemes contain the attribute y ({j/} C Ri). The extended 
active domain eadomP{y) for the given database instance and attribute y is 
defined as 

eadom^{y) = adom{y,'D\). 

For the entire relation scheme R = {7/1, ...,?/„} we define the extended active 
domain as 

eadorn^ = eadora^{yi) CO • • • [xi eadom®(j/„). 

It is easy to see that the eadorn^ contains every tuple on relation scheme R 
that can be built from all values of respective domains that are available in the 
database instance in question. The extended active domain can be seen as a 
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finite universe of tuples for the given database instance and relation scheme if 
we do not allow introduction of new domain values (by singleton relations). 

Remark 7. As an aside, let us mention that it is easy to modify the definition 
of extended active domain to incorporate new values introduced by singleton 
relations. Since RA-expressions are finite, the number of new values is finite as 
well. Before obtaining extended active domain eadorri^ by joining all extended 
active domains eadom{yi) for attributes yt G R {i G {!,...,n}) it suffices to 
unify each eadom{yi) with a (finite) set of new values whose domain coincides 
with the domain of attribute 

Now, we define the evaluation of PTC-expressions in database instances. 
Suppose we have the database instance T> and a PTC-expression T(r), where 
r = {ri,..., r„} such that each tuple variable is on relation scheme Ri. By 
evaluating T(r) in X> we obtain a relation on relation scheme R = ljr=i 
For any tuple r ^ eadom'^ we put (r) = 0. In other words, the relation 
may contain only tuples from eadorn^. For each tuple r G eadom^ we define 
its score in the relation as follows. 

Any tuple r G eadom^ induces a valuation of the tuple variables ri,..., 
from the PTC-expression. The valuation assigns each variable the projection 
of tuple r onto the relation scheme Ri of the variable in question, symbolically 
||ri|jr = r{Ri). We denote the join of valuated tuple variables ||ri||j. • • • ||r„||f. as 
||r||j.. It is easy to see that ||r||r = r. In general, for a set of tuple variables r' 
such that r' C r with relation scheme Rr' Q R it holds that ||r'||j. = r{Rr'). We 
define the score (||r||r) of tuple ||r|jr in the relation as follows. According 
to the form of PTC-expression we distinguish the following cases 

1 . if T(r) is E{r), we first evaluate the RA-expression E in the database 
instance X> according to RA-expression evaluation rules ([3]) and denote 
the resulting relation as E'^ , then we set T^(||r|jr) = A^(||r||r), 

2. if T(r) is 7i(ri) o72(r2), where o is on of the following symbols (g), A,— >■, 

and r = riUr 2 , first we get the scores T|^(||ri||r) and l^{\\Y 2 \\r) with the 
valuation induced by r. Then we set 7~^(||r||r) = (||ri||j.) oT^ (Iir 2 ||r), 

where o is one of the following operations g,A,— >■. 


3. if T(r) is VT'(r) or AT'(r) we get the score T'^(||r||,.). 
If T(r) is VT'(r) we set 
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If T(r) is AT'(r) we set 


1 if T'^dlrlL) = 1, 

I 0 otherwise. 

4. if T(r) is Vri '' 2 ) 01' An'' 2 ) where r = v^, first we get the 

scores T'^(||ri U Y2\\rr') with the valuation induced by the join of tuples 
r and r' for every r' € eadom^^ . Note that the tuples r and r' are always 
joinable as the relation schemes i?n and i?r 2 = R are disjoint (from the 
definition of PTC-expression). Since eadorn^ is finite, we obtain a finite 
set of scores {T'^(||ri U r 2 ||rr') I r' € eadom^ }. 

If T(r) is Vn T'(ri,r 2 ) we set 

(lAllr) = \J{T''^{\\yx U r 2 \\rr') \ r' € eadom^^^}. 

If T(r) is An T'(ri, r 2 ) we set 

'A^(IAIIr) = Al'^'^dAi U ralirr') I r' € eadom^^J. 

4.1.3 Splitting principle 

Consider a PTC-expression T(..., r,...) such that the tuple variable r is on 
relation scheme R. If we replace the tuple variable r with two (or more) fresh 
tuple variables ri, r 2 on Ri and R 2 such that i?i U i ?2 = R, we obtain a PTC- 
expression T'(. • ■, iTi, r 2 ,...) that differs only in the set of free variables. Despite 
being different on the syntactic level it is straightforward to see that for any 
database instance X> we have — T'^ . 

From the semantic point of view, we are free to “split” free tuple variables 
and “join” them back without changing the meaning of the PTC-expression. 
We call this “the splitting principle”. 

4.2 Equivalence of PTC and Relational Algebra 

In this section we show that the Pseudo Tuple Calculus and Relational Algebra 
are equivalent. First, observe that if we evaluate any RA-expression E on rela¬ 
tion scheme i? in a database instance X>, the relation may contain tuples 
from eadorn^ only, since eadorn^ consists of all tuples that can possibly be 
built from the values available in the database instance, i. e. we have 

= 0 whenever r ^ eadom^. (33) 

Using this observation we can easily prove the following theorem. 
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Theorem 8. For any RA-expression E on relation scheme R there is a PTC- 
expression T(r) on R such that for any database instance we have E'^ ir) = 
7~^(r) for all r G Tupl(i?). 


Proof. Since any RA-expression is directly an (atomic) PTC-expression we can 
take A(r) with a single tuple variable r on relation scheme R as the sought 
PTC-expression T(r) . From the definition of PTC-expression evaluation and the 
observation (331 we conclude that E'^ (r) = (r) holds for all r G Tupl(i?). □ 


It follows that the Pseudo Tuple Calculus is at least as powerful as the 
Relational Algebra. Before proving the converse theorem we need one more 
observation. Recall that the relation eadorn^ plays an important role in PTC- 
expression evaluation as it serves the purpose of an implicit range (or universe) 
for evaluation. Since evaluation of RA-expressions is unconstrained and takes 
all tuples in account we need to be able to construct a RA-expression Er that 
will evaluate to eadom'^ and will act as an explicit range for evaluation of 
RA-expressions. 

It is easy to see that the active domain adom{y,'D) for the given attribute 
y and relation V can be computed by evaluating the RA-expression Ay{D) = 
7r{j,}(VD) in database instance X>, where D is a relation symbol evaluated to 
relation D by the database instance. For the extended active domain eadomP (y) 
for an attribute y the RA-expression is Ey = IJig/ ), where are relation 

symbols whose relation scheme contains attribute y and the database instance 
interprets each relation symbol Df as relation T>f. Finally, we get the extended 
active domain eadorn^ for scheme R = {yi, ..., j/„} by evaluating Er = fy, ixi 
• • • CXI Ey^ in database instance X>. In other words we have eadoni^ = E^. 


Remark 8. As an aside, if we use the modified definition of extended active 
domain that allows introduction of new values by singleton relations, we need to 
modify the previous definition of Ey to reflect the extended meaning of eadom'^. 
The definition becomes Ey = UiG/ (-^y(^F)) UU^=i[2/- where Df are relation 
symbols whose relation scheme contains attribute y and Cj are symbols denoting 
new values from the domain of attribute y such that the database instance 
interprets each relation symbol as relation and each symbol Cj as the new 
value from the respective domain. 

Theorem 9. For any PTC-expression T(r) with r = {ri,..., r„}, where the 
tuple variables are on relation schemes Ri, there is a RA-expression F on 
relation scheme R = ljr=i such that for any database instance T) we have 
F'^{r) = T^{r) for all r G Tupl(i?). 
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Proof. The theorem is proved by induction on the complexity of the PTC- 
expression. In each step, we show the RA-expression F that forms the counter¬ 
part to the PTC-expression T(r) in question. Furthermore, we show that the 
results of evaluating both RA- and PTC-expression coincide, i. e. the relations 
and have the same relation scheme and contain the same tuples. Re¬ 
call that for a set of tuple variables r' on the relation scheme i?r' and a tuple 
r e Tupl(i?) such that i?r' C i? we have ||r'||r = r(i?r')- 

Let us have a PTC-expression T(r), where r = {ri,..., r„} such that each 
tuple variable is on relation scheme Ri. The relation scheme of the relation 
is R = ljr=i We obtain the equivalent RA-expression F as follows. 

1. If T(r) is E{r), the sought RA-expression F is E. 

Since the relation scheme of F is R, the relations and F^ have the 
same relation scheme. For any tuple r G eadorti^ we have 

r^(r)=r^(||r||,)=£;^(|!r||,) = A^(r) 


from the definition of PTC-expression evaluation. 


From (33) it follows that for all tuples r ^ eadorn^ we have F'^ir) = 0. 
Together, we have (r) = F'^{r) for all tuples r G Tupl(i?). 


2. If T(r) is 7i(ri) o72(i’ 2), where o is on of the following symbols 0, A,—>■, 
r = ri U r 2 and R = Rn U Rr 21 then from the induction hypothesis we 
have RA-expressions Ex on relation scheme R^^ and E 2 on relation scheme 
Rra corresponding to PTC-subexpressions 7i(ri) and 72 (r 2 ), respectively, 
such that Tj^(ri) = E'^[ri) and T^{r 2 ) = E^{r 2 ) for all ri G Tupl(i?i) 
and r 2 G Tupl(i? 2 )- 

According to the symbol o we distinguish three cases: 


(a) If o is 0, then we put E = Ei t><i E 2 . 

The relation scheme of E is Rn U Rra as required. We have 


r^(r)=r^(||r||,) 

= 7i^(||ri||,)®7?"(||r2||.) 

= 7?{r{R,,))®T?{r{Rr.)) 
= E^ir{RrJ)^Ef{r{RrJ) 
= {Ext^E2)'^{r{R,MRr.)) 
= E^{r) 
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for all tuples r € eadorn^. 

For any tuple r ^ eadorn^ either or both of r{Rr^) ^ eadorn^^ 
and r(i?r 2 ) ^ eadorn^^ must hold, otherwise we would arrive at 
contradiction. Without loss of generality let us assume that ^ 

eadorn^^ . Then we have 7^{r{Rr^)) = 0 and from the induction 
hypothesis we also have Ef {r{Rr^)) = 0. From the properties of 0 
we conclude that F'^{r) = 0 for r ^ eadom^. 

(b) If o is A, then we put F = {Ei cc ) n {E 2 cxi ). 

Since both Ei t<i and E 2 ixi £r^^ are on relation scheme Rr^ U 
i?r 2 ) the RA-expression F is well-defined and its relation scheme 
matches the relation scheme of the PTC-expression. 

Now, observe that for any tuple r G eadorn^ the following holds 

Ef (r(i?rj) = Ef (r(Rri)) ® eadorn^ (?'(-Rr2)) = {Ei ex £r {r) 

V y 

= 1 

Dually, it holds for E'^ as well. To put the in words, we can “extend” 
the relation scheme of some relation without changing the scores of 
tuples in this relation. Hence, we have 

T^(r)=r^(||r||,) 

= 7i^(l|ri||.)A7?"(||r2||.) 

= TY{r{R..))^l?{r{R,Y) 

= Ef{r{R,J)hEf{r{R,,)) 

= (Ai M £r^^ f^J)^ {E2 M £r^^ (r) 

= {{El ex £r^^ ) n (A2 IX £r^^ ))® (r) 

= F^{r) 

for all tuples r G eadomY- 

For any tuple r ^ eadomY, use the same argument as for the case 
with (g) concluding that F'^{r) = 0 for r ^ eadorn^. 

(c) If o is —then we put F = {Ei x £ji^^ ) {E 2 x £r^^). 

Since all Ei x £r^^ , E 2 x £r^_^ , and £r are on relation scheme 
i?ri Ui?r 2 j the RA-expression F is well-defined and its relation scheme 
matches the relation scheme of the PTC-expression. 

Observe that since 7~^(r) > 0 only for tuples r G eadom^ and 
£^{r) = eadomY{r) = 1 for any tuple r G eadomY, it holds that 
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7^ (r) = (r) 0 7"^ (r). Using previous observations we have 

r^(r)=£:|’(r)®r^(||r||,) 

= ^?W®(7;®(||ri||.)^7?"(|lr2||.)) 

= W 0 {7?{r{Rr,)) ^ 7?{r{Rr,))) 

= £f{T) ® {E-^{r{R.,)) ^ Ef{r{Rr,))) 

= £^ir) 0 ((Ui M ^ (U 2 M )®(r)) 

= ((Ui M (U 2 M ))® (r) 

= E^ir) 

for all tuples r G eadorn^. For any tuple r ^ eadorn^ we have £^ (r) = 0. 
From the properties of 0 we conclude that F'^{r) = 0 for r ^ eadorn^. 

3. If T(r) is VT'(r) or AT'(r), then from the induction hypothesis we 
have a RA-expression E on relation scheme R corresponding to PTC- 
subexpression T'(r), such that {r) = E'^{r) for all r G Tupl(i?). 

We put F = VE or F = AE, respectively. 

In both cases, the relation scheme of A is i? as required. Assuming that 
the symbol □ denotes V or A we have 

T^(r) = T^(||r||,) = □r'®(||r||,) = □r'""(r) = DE^ir) = E^{r), 

for all r € eadorn^. 

For any tuple r ^ eadorn^ we have F''^{r) = 0 and from the induction 
hypothesis we also have E'^{r) = 0. From the definition of V or A we 
conclude that F'^{r) = 0 for r ^ eadorri^. 

4. If T(r) is Vri 7”'(ri,r2) or {ri,V 2 ) where r = r 2 , i? = i?r 2 and 

i?riFi?r 2 = then from the induction hypothesis we have a RA-expression 
E on relation scheme i?ri U R corresponding to the PTC-subexpression 
T'(ri,r 2 ), such that T'^(rr') = E'^{rr') for all r G Tupl(i?) and r' G 
Tupl(i?ri)- Note that tuples r and r' are always joinable since the relation 
schemes i?ri and R are disjoint. 

We distinguish two cases: 

(a) if T(r) is T'(ri,r 2 ), then we put F = ttr{E). 
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The relation scheme of F is i? as required. We have 


T^{T) = 7^{\\v\\r) 

= V{7”'^(l|i'i u r2||rr') I r' e eadorn^^^} 

= \/{T''^{rr') I r' G eadorn^^^} 

= \J{E'^{rr') I r' G eadorn^^^} 

= \J{E'^{rr') I r' G Tupl(i?rJ} 

= {7rn{E) f (r)=F®(r) 

for all r G eadorn^. 

Observe that extending the range of r' from eadorn^^ to Tupl(i?ri) 
cannot change the score \/{F^(rr')} since for r' ^ eadoni^^ we 
have T'^(rr') = 0 and thus E'^{rr') = 0 for any r G Tupl(i?). 
Furthermore, for any a G F it holds that a V 0 = a. Hence, we have 

V{F^(rr') I r' G eadom® ^ } = \J{E'^{rr') \ r' G Tupl(i?ri)}. 

Now we show that F'^{r) = 0 for all r ^ eadorn^. For any r ^ 
eadom^ we have E'^{rr') = 0 and thus E'^ir) = \/{0: Oj • • ■} = 0- 

(b) if r (r) is An 'T' (r 1 , r 2 ), we put F = F . 

The relation scheme of F is F as required. We have 

T^(r)=T^(|lr||A 

= Al'^'^dki Uralirr') I r' G eadorri^^^} 

= I € eadom® 

= /\{E'^{rr') I r' G eadorn^^^} 

= ir') ^ I r' G Tupl(Fn)} 

= A{£^{r) ® (/) ^ F^(r/)) I r' G Tupl(Fn)} 

= (F ^^H£:^^A®(r) = F®(r) 
for all r G eadorn^. 

Note that extending the range of r' from eadomZ to Tupl(Fn) 
cannot change the final score of r, since for any r' ^ eadoni^ we 
have (F) —>■ E'^{rr') = 1 and it holds that a A 1 = a for any 
Cl € L. 

For any tuple r ^ eadorn^ we have = 0. Hence, we have 

E'^{r) = A{0j 0,...} = 0 for r ^ eadorn^. 
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Observe that instead of using (14) we can alternatively use Date’s 
Small Divide and put F' = Sr since it holds that 


r®(r)=r^(||r||.) 

= Al'^'^dki u r2||rr') I r' G eadorri^^^} 

= A{T'®(rr') I r' G eadom^^^} 

= f\{E'^{rr') I r' G eadorri^^^} 

= ir') ^ E-^irr') \ r' G Tnpl(i?rJ} 

= ^R{r) ® M{£R,y) E’^irr')) I r' G Tupl(i?rJ} 


for all r G eadorriR. 

For any tuple r ^ eadorriR we have (r) = 0. From the properties 
of 0 we conclude that F''^{r) = 0 for any r ^ eadorriR. 


□ 


5 More on Relationships of Division Operations 


In this section we use the Pseudo Tuple Calculus (PTC) to show further rela¬ 
tionships of the division operations presented in this paper. We utilize the PTC 
in the following way. Let us have an relational operation op that accepts in¬ 
put relations I?i,..., T’n on relation schemes i?i,..., and its output relation 
is on relation scheme R. For the input relations we consider relation symbols 
Di,..., on the respective relation schemes Ri,..., i?„. Note that the rela¬ 
tion symbols are themselves RA-expressions. Now using the relation symbols 
we construct a PTC-expression T(r) on R that is semantically equivalent to 
the operation in question. By semantical equivalence we mean that if we evalu¬ 
ate the PTC-expression T(r) in a database instance X> that maps the relation 
symbols to the input relations, i. e. we have - we get 

that 

op{Vi,...,Vn){r)=T^{r) 


for all r G Tupl(i?). Note that this construction does not depend on the actual 
content of the input relations. Furthermore we apply the Theorem to trans¬ 
form the PTC-expression T(r) to an equivalent RA-expression that uses only 
the fundamental operations of the algebra and obtain the requested relationship. 

We give an example to illustrate the notion of semantical equivalence. Con¬ 
sider the division operation defined by (14), i.e., for RDTs I?i, I? 2 , and Vs on 
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RS, S, and R, respectively, the division T>i T >2 of T>i by T >2 which ranges 
over T >3 is an RDT on R defined by 

{V, V 2 ){r) = As6Tupi(S)(2^3(r) 0 (I? 2 (s) ^ 2?i(rs))), 

for each r S Tupl(i?). Consider relation symbols Di, D 2 and D 3 on RS, S, and 
R, respectively. Then the PTC-expression 

T{r) = AsC^3(ir) O ([D2($) Di(r$))), 

is semantically equivalent to the division operation. More precisely, for a database 
instance T> such that Df = Vi, Df* = I? 2 , and D® = we have 

(Pi P 2 )(r) =T^(r) 

for all r € Tupl(i?). Now, we are ready to show the relationships among the 
division operations. 


Theorem 10. Let Pi, P 2 , and P 3 be RDTs on RS, S, and R, respeetively, 
and let -^gsdo be Date’s Small Divide. For the division operation defined by (141 
we have 

(Pi P2)(r) = (f® 
for all r G Tupl(i?) where 


= P 3 IX] ((P 2 ixi Sf) x>i) 


and the extended active domains , and £'^g contain tuples built only from 

the values from relations Pi,P 2 , and P 3 . 


Proof. First, using the relation symbols Di,D 2 , and D 3 , corresponding to the 
input relations Pi,P 2 , and P 3 , we construct a PTC-expression T(r) that is 
semantically equivalent to the division operation. We have 


= As('D 3 (ir) O (1D2($) ^ Di(r$))) 

and it holds that (Pi P 2 )(r) = {r) for all r € Tupl(i?) in any database 

instance P that maps the relation symbols to their respective input relations. 
According to the Theorem there is an equivalent RA-expression F such that 
T^fr) = F'^{r) for all r € Tupl(i?). The sought RA-expression F is 

F = £r -^fsdo 

where 

p = D3 CXI ((D2 CXI £rs) —(Di [XI £s)). 


29 



By evaluating E in the database instance T> that maps the relation symbols to 
their respective input relations we get a relation 

E'^ =V3iXi ((X>2 ixJ £rs) (Pi ^f))- 

The most simple database instance that maps the relation symbols to their 
respective input relations contains just the relations and The 

relations £rs,£^, and £^, obtained by evaluating £rSt£s^ and £r, in such 
database instance therefore contain tuples built only from the values from rela¬ 
tions and as required. 

It can be easily checked that even if the database instance contained more 
relations and thus the extended active domains £'rsP'F ^ contained 

more tuples built from values from other relations these additional tuples do 
not change the result of evaluating the RA-expression F. It is safe to build the 
relations £rsP^^ £ri only from the values from relations I?i,I?2, and 

From the properties of cxi and the fact that £'g contains the projection of 
relation Vi to S and the relation I?2, "we can further simplify the form of the 
relation E^ to 

= X>3 to ((X>2 CXI £r ) Vi). 

Putting all things together we have 

p, ^ (£^^^sl£s)(r) 

for all r G Tupl(i?) with the relations E'^ and £'^, £f ,£'rs defined as above. □ 


Theorem 11 . Let Di, £>2, and £>3 be RDTs on R, S, and RS, respeetively, 
and let -G be the division operation defined by ( 14 ). For Date’s Small Divide we 
have 

pi 2 ^ 2 )(r) = p3 P" £f)){r), 

for all r G Tupl(i?) where 

E'^ = {(V2>^£^) V3) 

and the extended active domains £^, £^, and£^g contain tuples built only from 
the values from relations I?i,I?2) and D3. 


Proof. Use similar arguments as in the proof of Theorem 10 


□ 


Theorem 12 . Let Di, D2, Rs, and D4 be RDTs on Ri, R2, R3, and R4, re¬ 
spectively, and let -G be the division operation defined by ( 14 ). For Darwen’s 
Divide we have 

pi -^dd^ 2^2) (r) = ((Pi « V2) (A® £rOP). 
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for all r € Tupl(i?i U i?2) where 

= ((I ?4 S'g,) ^ £^J), 

R[ = (i?4 n (i?i u R2)) u (i?i n R3), 

i?2 = R4 \ (Rl U R2), 

= i?3 n (i?i u i?4), 

i?4 = i?4 u {Ri n i?3) 

and the extended active domains contain tuples built only from the values from 

relations Vi,1)2,Rs, andVi. 

Proof. As in the previous proofs, we construct PTC-expression T(r) that is 
semantically equivalent to the division operation defined by Using the 

relation symbols Di,D2,D3, and D4 that correspond to the input relations 
I?i,I? 2 , 253 , and £>4 we get 

r(r) = (Di cx] D2)(r)(g) Ar;(^4(rf,r;) ^ r'b)) 

where 

• r is on a relation scheme i?i U i?2, 

• r{j is on i?2 = A4 \ (i?i U R2), 

• Tf is on i?4 n (i?i U R2), 

• i?3 = A3 n (Ai U A4), 

• r" is on (Ai U {R2 n A4)) n A3, 

• r(j' is on A2 n A3 

such that each set of tuple variables contains one tuple variable for each attribute 
in the relation schema of the corresponding subexpression. For instance, the set 
of tuple variables r can be characterized as r = | y G Ai U A2}. 

According to the Theoremj^there is an equivalent RA-expression F such that 
T^{r) = F'^{r) for all r G Tupl(A). Again, the database instance X> should 
map each relation symbol to its corresponding input relation. In order to find 
the RA-expression F, we first find the RA-expression E that corresponds to the 
PTC-subexpression D4(rf, rj^) —>• 7 r 4 ^AlD 3 )(rA fb)- ^he sought RA-expression is 

E= (^(D4 


31 


where i?4 = i?4 U i?3 = i?4 U {R3 n (i?i U R4)) = R4U {Ri n R3). 

Now, we are ready to find the RA-expression F that corresponds to the 
whole PTC-expression T(r). We have 

where R[ = i?4 \ i?2 = (^4 ^ (^1 U ^2)) U (i?i n R3). Since it holds that 
R'l Q {Ri U R2) the relation scheme of F is U R2 as required. 

The rest of the proof is clear. □ 


In the previous chapters, we have already shown that Date’s Small Divide is 
a special case of Date’s Great Divide which is in turn a special case of Darwen’s 
Divide. Furthermore, we have shown that if the L is prelinear or divisible, then 
there is a simple correspondence between Date’s Small Divide and the division 
operation defined by ( 14 ). 

In this chapter we have shown that they are equivalent regardless of the 
properties of L. We have also shown that Darwen’s Divide can be expressed by 
the division operation defined by ( 14 ). As a consequence we get the equivalence 
of all domain-independent division operations presented in this paper. Further¬ 
more we have an exact way to express one division using the other. Therefore 
we can summarize the observations as follows: 


Corollary 13 . All domain-independent division operations presented in this 
paper are equivalent. □ 

This result solves an open question concerning the relationship of Date’s 
Great Divide and Darwen’s Divide in the classic setting, see [111 page 187 ]. 


6 Conclusion 


We have presented a survey of graded generalizations of classic division-like op¬ 
erations in a rank-aware model of data. We have focused on generalizing variants 
of division-like operations which are neglected by other rank-aware approaches 


in databases. In our model we assume that ( 141 is a fundamental operation. 


Under this assumption, we have shown that all the graded generalizations of 
the classic division operations we have studied in this paper are derived opera¬ 


tions. That is, considering the original graded division ( 14 ) as the fundamental 


division, i.e., including it in the relational algebra, all the other divisions ( 16 ) 


( 19 ), ( 24 ) and ( 32 ), are derived operations in our model. Furthermore, using 


the Pseudo Tuple Calculus (PTC), we have shown that the various variants of 
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the division operations are mutually definable. Interestingly, some of our obser¬ 
vations we have made on the general level (considering L as a general complete 
residuated lattice) pertain to the classic model—when L is considered as the 
two-element Boolean algebra. For instance, we have shown that Date’s Great 
Divide and Darwen’s Divide are mutually definable. This result solves an open 
question that was stated by Date in [TtI page 187]. 

Future research in the area may include considerations on the role of fun¬ 
damental and derived operations in the model. The fundamental division (141 
cannot be dropped without losing the expressive power of the relational algebra 
since in general we cannot introduce universal quantifiers using the existential 
ones. On the other hand, there may be ways to simplify the present relational 
algebra by considering other forms of division-like operations. One way to go 
is to introduce graded subsethood as a fundamental (graded) comparator of 
relations, and use analogous techniques as image relations m to express the 
division. 
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